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Expressi n of Glyc syltransf rase in Aspergillus 
Technical Fi Idofth Invention 

The present invention is directed to increased secretion of desired polypeptides (particularly 
glycosyltransferases) from filamentous fungi (particularly Aspergillus). The invention discloses 
DNA sequences, vectors, fusion polypeptides, and processes for obtaining enhanced 
production and secretion levels of the desired polypeptide. More particularly, the invention 
discloses DNA sequences, vectors, fusion polypeptides and processes for the increased 
secretion of sialyltransferase, galactosyltransferase and fucosyltransferase in Aspergillus, 
preferably A. niger war. awamori. 

Background of the Invention 

One of the earlier successes of recombinant DNA technology involved the intracellular 
expression of the A and B chains of insulin in bacteria as carboxyl fusions to p-galactosidase. 
Goeddel, D.V. etal. (1979) Proc. Natl. Acad. Sci. USA 76:106-110: Johnson, I.S. (1983) 
Science 219:632-637. Since then, numerous examples have been described for the expression 
of fusion polypeptides comprising, in part, a heterologous polypeptide. Marston, F.A.O. (1986) 
Biochem. J. 240:1-12 summarizes the production of heterologous polypeptides in E. coli. As 
described therein, a number of heterologous polypeptides have been expressed intracellularly 
as fusion polypeptide in E. coli In addition, heterologous polypeptides have reportedly been 
secreted into the periplasmic space of such microbes by fusing the heterologous polypeptide 
with a signal sequence. In some cases, the heterologous polypeptide was secreted from E. coli 
into the culture medium when expressed with a signal sequence of bacterial origin. In those 
cases where a heterologous protein has been expressed as a fusion with the entire native 
protein of the host bacteria, the rationale was primarily to increase stability or ease the 
purification of the fusion polypeptide. 

For example, Scholtissek, S. et al. (1988) Gene 62:55-64, report the expression in E. coli of a 
triprotein consisting of bacterial p-galactosidase, a collagenase recognition site and the single 
stranded DNA binding protein from E. coli. The p-galactosidase portion of this fusion 
polypeptide reportedly was used to purify the fusion polypeptide from a crude cell lysate by 
affinity chromatography on APTG-Sepharose. The single stranded DNA binding protein from £. 
coli was thereafter isolated from the fusion polypeptide by reacting the collagenase recognition 
site with collagenase. Similarly, Smith, D.B. et al. (1988) Gene 67:31-40 report the bacterial 
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expression of a vector ncoding a fusion polypeptide consisting of glutathione S-transferase 
fused at its C-terminus with a recognition site for blood coagulation factor Xa which itself is fused 
to either of two het rologous polypeptides corresponding to diff rent antig ns f P. falciparum. 
In another example, Guan, C. et al. (1988) Gene 67:21-30 report the expression and purification 
of fusion polypeptides consisting of maltose binding protein fused either to p-galactosidase or 
Pstl endonudease, and a fusion protein consisting of the bacterial phoA signal, maltose binding 
protein and phoA protein. In the former cases, the fusion polypeptides were extracted from 
crude bacterial lysates by affinity chromatography on cross-linked amylose whereas, in the 
latter, the fusion protein was obtained from the periplasmic space after spheroplast formation 
and affinity chromatography on cross-linked amylose. 

The expression of fusion polypeptides in yeast has also been reported. For example, Cousens, 
L.S. et al. (1987) Gene 61:265-275 describe a fusion polypeptide consisting of a human 
superoxide dismutase-human proinsulin fusion protein with a methionine residue at the junction 
of the two proteins. Superoxide dismutase is an intracellular protein and the fusion polypeptide 
was reportedly expressed as an insoluble inclusion body within the yeast expression host with 
incorrect disulfide bonds. After sulfitollysis proinsulin was reportedly purified, renatured and 
processed to yield insulin after cleavage of the methionine residue with cyanogen bromide. 

United States Patent No. 4,751,180 to Cousens et al. states that a polypeptide of interest may 
be obtained in high yield from an expression host, such as yeast, when the polypeptide of 
interest is expressed as a completely heterologous fusion polypeptide. One of the heterologous 
polypeptides is produced in high yield in the expression host typically in amounts greater than 
five percent of the total protein produced by the host. The only high yield heterologous 
polypeptide disclosed, however, is that of the intracellular protein human superoxide dismutase 
which is fused to either proinsulin or lgF-2. The specification also states that a secretory leader 
and processing signal may be included as part of the fused polypeptide. No example is 
provided which indicates that secretion would be obtained and, if obtained, would be at levels 
higher than that which have been obtained using a fusion construction which detected the high 
yield heterologous protein in a fusion consisting of a secretory leader sequence fused to only 
the polypeptide of interest e.g., proinsulin or insulin-like growth factor (lgF-2). 

Heterologous gene expression has also been reported in filamentous fungi. For example, 
Christensen, T. et al. (1988) Bio/Technoloov 6:1419-1422 have reported an expression vector 
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utilizing the a-amylase promotor from A. oryzae to express th prepro form f aspartyl 
prot inas fromth filamentous fungus Rhizomuchor miehei. Wh n express d in A. oryzae, 
aspartyl proteinase was obtained from the culture medium. Gwynne, D.I. et al. (1987) 
Bio/Technology 5:713-719 report the expression and secretion of human interferon and 
bacterial endoglucanase from filamentous fungi by expressing these genes with either a fungal 
glucoamylase signal or a synthetic consensus signal sequence. 

US Patent 5,364,770 and BP Patent 0 215 594, both incorporated herein by reference, teach 
the expression and secretion of various heterologous polypeptides, such as chymosin, in 
filamentous fungi, and particularly Aspergillus. 

Upshall, A. et al. (1987) Bio/Technology 5:1301-1304 report the expression and secretion of 
human tissue plasminogen activator by expressing the gene encoding the pre-form of t-PA in a 
filamentous fungus. Further, Tumbull, I.F. et al. (1989) Bio/Technology 7:169-174 report an 
attempt to express and secrete bacterial enterotoxin subunit B from filamentous fungi. No 
secreted material, however, was detected. 

Bovine prochymosin has reportedly been expressed in Escherichia colt, the yeasts 
Saccharomyces cerevisiae and Yarrowia lipolytics, and in filamentous fungi by the inventor in 
Aspergillus species. In E. coli prochymosin, with the first four amino acid residues replaced by 
an amino-terminal fragment of the trpE gene, has reportedly been produced under the control of 
the trp promoter (Nishimori, K. et al. (1984) Gene 29:41-49). The fusion protein accumulated as 
inclusion bodies in the cytoplasm but after appropriate extraction conditions could be activated 
to yield mature chymosin. 

Moir et al. (1985) (in Developments in Industrial Microbiology, Vol. 26, Underkofler, LA. (ed.), 
Society for Industrial Microbiology, Arlington, VA, USA) described intracellular production of 
prochymosin in S. cerevisiae. The protein was synthesized with various segments of 
phosphoglycerate kinase, triosephosphate isomerase or galactokinase attached to the amino 
terminus, allowing increased production compared to direct expression from the same 
promoters. It was suggested that the increase in production was due to more efficient 
translation of the mRNA. Moir et al. also reported secretion of prochymosin from S. cerevisiae, 
in the form of a fusion with the first few residues of invertase or alpha factor. The extracellular 
prochymosin was activated at low pH to give mature chymosin despite the additional amino 



WO 96/36718 PCT/US96/07171 

- 4 _ 

acids on the prosequ nee. Similarly, activatabl prochymosin was s creted from th yeast Y. 
lipolytics with either 14 or 90 residues of native alkaline extracellular protease attached to the 
amino terminus (Franke, A.E. et al. (1988) in Developments in Industrial Microbiology, Vol. 29, 
Pierce, G. (ed.), Society for Industrial Microbiology, Arlington, VA, USA). In this report, no more 
than about 20% of the amino terminus of the protease was used to generate the fusion 
polypeptides and no apparent advantage accrued from expression as fusion polypeptides. 
Active calf chymosin has also been produced in the filamentous fungus Trichoderma reesei 
(Harkki, A. et al. (1989) Bio/Technology 7:596-603. The cellobiohydrolase I gene (cbhf) 
promoter and terminator regions were employed and four different constructions were made 
employing different signal sequences fused to prochymosin cDNA. Either the chymosin signal 
sequence, cbhf signal sequence, a hybrid c/>/?//chymosin signal sequence or the cbhl signal 
sequence plus 20 amino acids of mature cbhl were fused to the amino terminus of 
prochymosin. Slightly better production was obtained from the latter construction although 
insufficient numbers of transformants were examined to confirm this. Secretion was inefficient 
with approximately 66% of the chymosin-derived material remaining within the cell of 
transformants regardless of the type of vector construction used. 

The glaA gene encodes glucoamylase which is highly expressed in many strains of Aspergillus 
n/gerand Aspergillus awamori. The promoter and secretion signal sequence of the glaA gene 
have been used to express heterologous genes in Aspergilli including bovine chymosin in 
Aspergillus nidulans and A. awamori as previously described by Cullen, D. et al. (1987) 
Bio/T echnoloqy 5:71 3-71 9. In the latter experiments, a variety of constructs were made, 
incorporating prochymosin cDNA, either the glucoamylase or the chymosin secretion signal and, 
in one case, the first 1 1 codons of mature glucoamylase. Maximum yields of secreted chymosin 
obtained from A. awamori were below 15 mg/L in 50 ml shake flask cultures and were obtained 
using the chymosin signal sequence encoded by pGRG3. These previous studies indicated 
that integrated plasmid copy number did not correlate with chymosin yields, abundant 
polyadenylated chymosin mRNA was produced, and intracellular levels of chymosin were high 
in some transformants regardless of the source of secretion signal. It was inferred that 
transcription was not a limiting factor in chymosin production but that secretion may have been 
inefficient It was also evident that the addition of a small amino-terminal segment (1 1 amino 
acids) of glucoamylase to the propeptide of prochymosin did not prevent activation to mature 
chymosin. The amount of extracellular chymosin obtained with the first eleven codons of 
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glucoamylase, howev r, was substantially less than that obtained wh n the glucoamylas signal 
was used alone. 

USSN 08/318,491 and EP Application WO90/15860, incorporated herein by reference, disclose 
the enhanced secretion of desired polypeptides by and from filamentous fungi including fusion 
DNA sequences, expression vectors containing such DNA sequences, transformed filamentous 
fungi, fusion polypeptides and processes for expressing and secreting high levels of such 
desired polypeptides. Specifically described is the enhanced secretion of chymosin from 
filamentous fungi including fusion DNA sequences, vectors containing such DNA sequences, 
transformed filamentous fungi, fusion chymosin polypeptides and processes for expressing and 
secreting high levels of chymosin. 

Despite these publications regarding the expression and secretion of various polypeptides from 
various filamentous fungi, including the production of desired polypeptides in filamentous fungi 
as fusions, there is a need to provide enhanced production of certain desired polypeptides, 
particularly glycosyltransferase enzymes, from Aspergillus, and particularly A. niger. The 
tetrasaccharide sialyl Lewis x serves as a ligand for a family of cell adhesion molecules, known 
as selectins, which mediate the recruitment of neutrophils and other leucocytes to sites of 
inflammation and tissue injury. An oligosaccharide mimicking sialyl Lewis x ((Cylexin® to be 
commercialized by Cytel, San Diego, CA) is being synthesized by combined chemical and 
enzymatic means for testing as a treatment for inflammatory diseases. Enzymatic synthesis 
involves using glycosyltransferases such as a-2,3-(N) sialyltransferase (ST), p-(1,4) galactosyl 
transferase (GT) and a-(1,3) fucosyltransferase (FT) in conjunction with other enzymes that 
regenerate the donor sugar nucleotide substrate. Commercial scale production of the enzymes 
GT, FT and ST is difficult and, therefore, the present inventors have proposed to produce such 
transferases as fusions in an Aspergillus system. Using a-2,3-(N) sialyltransferase as a 
prototype for glycosyttransferase production, a-2,3-(N) sialyltransferase activity was tested in a 
variety of systems, including baculovirus, mammalian cells and filamentous fungus, using 
multiple vector constructs. The filamentous fungus Aspergillus niger var. awamori, 
advantageously, was found to produce consistent, high levels of <x-2,3-(N) sialyltransferase that 
facilitates cost effective synthesis of the complex carbohydrates. 

Accordingly, an object of the present invention is to provide for the expression and enhanced 
secretion of certain glycosyltransferases in Aspergillus, including fusion DNA sequences, 
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xpression v ctors, transform d host cells, fusion transf ras s and processes for xpressing 
and secreting high lev Is of such transferases. Glycosyltransferases of eukaryotes, which are 
involved in the glycosylation of proteins, are normally located within the secretory apparatus of 
the cell. They typically possess a transmembrane anchor sequence which serves to localize 
them on a membrane. The transmembrane coding sequence is preferably deleted from the 
glycosyltransferase encoding DNA sequence for expression in Aspergillus. 

The references discussed above are provided solely for their disclosure prior to the filing date of 
the instant case. Nothing herein is to be construed as an admission that the inventors are not 
entitled to antedate such disclosure by virtue of prior invention or priority based on earlier filed 
applications. 

Summary of the Invention 

In accordance with the above objects, the invention includes novel fusion DNA sequences 
comprising from the 5' terminus: 1) a DNA sequence encoding a signal peptide functional as a 
secretory sequence in Aspergillus; 2) a DNA sequence encoding a secreted polypeptide or 
portion thereof which is normally secreted from a filamentous fungus, and particularly from 
Aspergillus; 3) optionally a DNA sequence encoding a cleavable linker polypeptide; and 4) a 
DNA sequence encoding a desired glycosyltransferase polypeptide. When the fusion DNA 
sequence is expressed in Aspergillus, increased secretion of the desired polypeptide is obtained 
as compared to that which is obtained when the desired polypeptide is expressed from DNA 
sequences encoding a fusion polypeptide which does not contain the second polypeptide 
normally secreted from an Aspergillus host or as compared to the production of such 
glycosyltransferase in other expression systems such as baculovirus. 

The invention also includes expression vectors containing the above fusion DNA sequence and 
Aspergillus strains transformed with such expression vectors. The invention also includes the 
fusion polypeptide encoded by such fusion DNA sequences. 

Further, the invention includes a process for producing a desired glycosyltransferase 
polypeptide comprising transforming a host Aspergillus strain with the above described 
expression vector and culturing the host strain to secrete the desired polypeptide into the culture 
medium. 
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Figure 1 shows a diagram of the 3-D organization of Aspergillus glucoamylase. 

Figures 2 A-E show the construction of pGAST and pGASTK, including Seq ID Nos. 1 and 2 in 
Figure 2A; and Seq ID Nos. 3-6, respectively, in Figure 2B. 

Figures 3 A-E show the construction of pGADVST and pGADVSTK, including Seq ID Nos. 7-10, 
respectively, in Figure 3A. 

Figure 4 shows Sheftone N concentration effect 

Figure 5 shows effect of media pH after inoculation. 

Figure 6 shows effect of sodium citrate. 

Figure 7 shows the map of pGAKHi+, an intermediate vector for FT expression. 
Figure 8 shows the construction of pGAKHi+FT. 

Figure 9 shows the effect of agitation speed on FT activity in KK17 transformant 
Figure 10 shows the effect of pH on FT activity in KK17 transformant 
Figure 1 1 shows the construction of pVL1 393-FT. 
Detailed Description 

The inventors have discovered that desired glycosyltransferase polypeptides can be expressed 
and secreted at levels higher than that previously obtained from other expression systems such 
as baculovirus or mammalian cells by expressing the transferase as a fusion of the transferase 
with a polypeptide which is normally secreted from an Aspergillus host system. 

Many glycosyltransferases useful in the present invention have been cloned. For example, 
those listed in the following Table have been cloned and described in the literature. 
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Cloned Glvcosvltransferases 

I. Fucosvltransferases E.C. Numb r Reference 
a1,2 2.4.1.69 (1) (2) (3) 
a1,3/4(FTIII) . * - (4) 
a1,3(FTIV) - (5) (6) (7) (8) 
a1,3(FTV) - (9) 
a1 ( 3(FTVI) - (10) (11) 
a1,3(FTVII) - (12) 

II. Galactosvltransferases 

a1,3 2.4.1.151 (13) (14) (15) 

a1,3 - _ (16) 

p1,4 2.4.1.38 (17) -(21) 

III. N-Acetvlqalactosaminvltransferases 

a1 t 3 - (22) (23) 

p1 t 4 - (24) (25) 

pi ,4 (CT antigen) - (26) 

ct1 ,Thr polypeptide - (27) (28) 

IV. N-Acetvlqlucosaminvltransferases 

pi,2GnTI . 2.4,1.101 (29) -(32) 

pi,2GnTII - (33) 

p1,4GnTIII 2.4.1.144 (34) (35) 

pi,6GnTV - (36) (37) 

p1,6GnT(Olinked) - (38) 

pi,6GnT - (39) 

GPT - (40)- (43) 

Hyaluronan synthetase - (44) 

V. Sialvltransferases 

a2 ( 6(N) 2.4.99.1 (45) -(50) 

a2,6 (O) 2.4.99.3 (51) 

o2,6 (O) - (52) 

o2,3(N) 2.4.99.6 (53) (54) 

a2 t 3(0) 2.4.99.4 (55) (56) 

o2,3 (O) - (57) 
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o2,3 (O/N) 



STX 



o2,8 



(58) (59) 
(60) (61) 
(62) 



VI. Glucosvltransferases 



Alg8 
Atg5 



(63) 
(64) 



VII. Mannosvltransferases 
a1,2 

a1,3(Alg2) 
pi,4(Alg1) 



(65) 
(66) 
(67) 
(68) 
(69) 
(70) 



Dpm1 
Och1 



Pmt 
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By using the fusion DNA constructions in Aspergillus as described herein, a dramatic increase in 
extracellular glycosyltransferase has been obtained versus expression in other systems. In 
some cases expression levels are approximately 10 fold in excess of that obtained in other 
expression systems. 

The increased secretion levels of glycosyltransferase are the result of expressing the 
transferase as a fusion polypeptide with a polypeptide normally secreted by a filamentous 
fungus. In the preferred embodiments, glucoamylase from A. niger var. a wamori encoded by 
the glaA gene, including the secretory signal sequence of glucoamylase, is fused to the amino 
terminus of the transferase from which the transmembrane anchor region has been deleted. 
The presence of the glucoamylase signal sequence and all or part of mature glucoamylase 
peptide sequences facilitate the enhanced secretion of the fusion polypeptide into the culture 
medium. 

As used herein, a "fusion DNA sequence" comprises from 5' to 3* DNA encoding: a signal 
peptide; a secreted polypeptide or a portion thereof; optionally a cleavable linker polypeptide; 
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and a desired glycosyltransferase. Specifically, the fusion DNA comprises DNA ncoding a 
signal peptid functional as a s cretory sequence in Aspergillus. Such signal sequences 
include those from glucoamylase, a-amylase and aspartyl proteases from Aspergillus awamori, 
Aspergillus niger, Aspergillus oryzae, signal sequences from cellobiohydrolase I, 
ceilobiohydrolase II, endoglucanase I, endoglucanase II from Trichoderma, signal sequences 
from glucoamylase from Neurvspora and Humicola, as well as signal sequences from 
eukaryotes, including the signal sequence from bovine chymosin, human tissue plasminogen 
activator, human interferon and synthetic consensus eukaryotic signal sequences such as that 
described by Gwynne (1987) supra. Particularly preferred signal sequences are those derived 
from polypeptides secreted by the expression host used to express and secrete the fusion 
polypeptide. For example, the signal sequence from glucoamylase from Aspergillus awamori is 
preferred when expressing and secreting a fusion polypeptide from Aspergillus awamori. 

In addition, the fusion DNA sequence comprises DNA encoding "secreted polypeptides" 
normally expressed from filamentous fungi. Such secreted polypeptides include but are not 
limited to glucoamylase, a-amylase and aspartyl proteases from Aspergillus awamori, 
Aspergillus niger, and Aspergillus oryzae, cellobiohydrolase I, cellobiohydrolase II, 
endoglucanase I and endoglucanase II from Trichoderma and glucoamylase from Neurvspora 
species and Humicola species. As with the signal sequences, preferred secreted polypeptides 
are those which are naturally secreted by the Aspergillus expression host. Thus, for example, 
when using Aspergillus awamori, preferred secreted polypeptides are glucoamylase and a- 
amylase from Aspergillus awamori, most preferably glucoamylase. 

Fusion DNA sequences may further comprise DNA sequences encoding a cleavable linker 
polypeptide. Such sequences include those which encode the prosequence of bovine 
chymosin, the prosequence of subtilisin, prosequences of retroviral proteases, including human 
immunodeficiency virus protease and DNA sequences encoding amino acid sequences 
recognized and cleaved by trypsin, factor Xa collagenase, clostripin, subtilisin, chymosin, fungal 
KEX2 protease and the like. See, e.g.. Marston. F.A.Q. (19861 Biol. Chem. J. 240:1-12. Such 
linker sequences may also encode the amino acid methionine which may be selectively cleaved 
by cyanogen bromide. It should be understood that the linker sequence need only encode that 
amino acid sequence which is necessary to be recognized by a particular enzyme or chemical 
agent to bring about cleavage of the fusion polypeptide. Thus, the entire prosequence of, for 
example, chymosin or subtilisin need not be used. Rather, only that portion of the prosequence 
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which is necessary for recognition and cleavage by th appropriate nzyme is required. For 
example, the linker may comprise a recognition sequence (Lys-Arg) for the key protease which 
sequence may be ins rt d at the fusion junction as in pGASTK. 

Furthermore, the fusion DNA sequence comprises a "desired glycosyltransferase polypeptide." 
Such desired polypeptides include any glycosyltransferase as provided in the Table above. 
Preferred glycosyltransferases useful in the present invention include a-(2,3) sialyltransferase; 
P-(1,4) galactosyltransferase and a-(1,3) fucosyltransferase. 

The above-defined DNA sequences encoding the corresponding amino acid sequences are 
combined to form a "fusion DNA sequence." Such fusion DNA sequences are assembled in 
proper reading frame from the 5' terminus to 3* terminus in the order presented above. As so 
assembled, the DNA sequence will encode a "fusion polypeptide" encoding from its amino 
terminus a signal peptide functional as a secretory sequence in Aspergillus, a secreted 
polypeptide or portion thereof normally secreted from Aspergillus, a cleavable linker polypeptide 
and a glycosyltransferase. 

As indicated, the DNA encoding a signal peptide is functional as a secretory signal in an 
Aspergillus regardless of whether the signal peptide is from an Aspergillus or from some other 
source such as the chymosin signal from bovine species. The signal sequences may be 
derived from a secreted polypeptide from a particular species of Aspergillus, 
As indicated, all or part of the mature sequence of the secreted polypeptide is used in the 
construction of the fusion DNA sequences. In a preferred embodiment of the present invention, 
the secreted polypeptide is glucoamylase from A. niger or A. niger var. awamori. 

Aspergillus n/gerand Aspergillus niger var. awamori {A. awamori) glucoamylases have 
identical amino acid sequences. The glucoamylase is initially synthesized as 
preproglucoamylase. The pre and pro regions are removed during the secretion process 
so that mature glucoamylase is released to the external medium. Two forms of mature 
glucoamylase are recognized in culture supernatants, GAI which is the full-length form 
(amino acid residues 1-616) and GAll which is a natural proteolytic fragment comprising 
amino acid residues 1-512. GAI is known to fold as two separate domains joined by an 
extended linker region. The two domains are the 471 residue catalytic domain (amino 
acids 1-471) and th 108 residue starch binding domain (amino acids 509-616), the linker 
region being 36 residues in length (amino acids 472-508). GAll lacks the starch binding 
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domain. These details of glucoamylas structur are reviewed by Libby et al. (Protein 
Engineering 7: 1 1 09-1 11 4, 1994) and are shown in diagrammatic form in Fig. 1. 

Desired polypeptides have previously been produced in filamentous fungi as fusions with a 
naturally secreted fungal polypeptide. In each case where a comparison has been made 
between production of desired polypeptide as a fusion with a naturally secreted fungal 
polypeptide versus production of the desired polypeptide alone (i.e., not fused to another 
polypeptide), a significant increase in yields of secreted desired polypeptide resulted from 
production as a fusion protein. Bovine chymosin and porcine pancreatic prophospholipase 
A 2 have both been produced in A. niger or A. n/gervar. awamori as fusions to full-length 
GAI (USSN 08/318,494; Ward et al.. Bio/Technology 8:435-440,1990; Roberts et al.. Gene 
122:155-161, 1992). Human interleukin 6 (hlL6) has been produced in A. nidulans as a 
fusion to full-length A. niger GA\ (Contreras et al., Biotechnology 9:378-381, 1991). 
Fusions of desired polypeptides with truncated forms of glucoamylase have also been 
produced. Unpublished work by the inventors has shown that fusion of bovine 
prochymosin to glucoamylase residues 1-501 leads to similar yields of chymosin as fusion 
of prochymosin to full-length glucoamylase. Hen egg white lysozyme (Jeenes et al., FEMS 
Microbiol. Lett. 107:267-272, 1993) and human lactoferrin (Ward et al., Bio/Technolog y 
13:498-503, 1995) have been produced in A. nigeras fusions to residues 1-498 of 
glucoamylase and hlL6 has been produced in A. nigeras a fusion to glucoamylase 
residues 1-514 (Broekhuijsen et al., J. Biotechnol. 31:135-145, 1993). In some of the 
above experiments (Contreras et al., 1991; Broekhuijsen et al., 1993; Ward et al, 1995) a 
KEX2 protease recognition site (Lys, Arg) has been inserted between glucoamylase and 
the desired polypeptide to allow in vivo release of the desired polypeptide from the fusion 
protein as a result of the action of a native Aspergillus KEX2-like protease. 

Additionally, bovine chymosin has been produced in A. nigeras a fusion with full-length 
native alpha-amylase (Korman et al., Curr. Genet. 17:203-212, 1990) and in A. oryzae as a 
fusion with truncated forms of A. oryzae glucoamylase (either residues 1-603 or 1-511; 
Tsuchiya et al., Biosci. Biotech. Biochem. 58:895-899, 1994). 



Trichoderma reesei produces several celiulase enzymes, including cellobiohydrolase I 
(CBHI), which are folded into two separate domains (catalytic and binding domains) 
separated by an extended linker region. Foreign polypeptides have been secreted in T. 
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reese/ as fusions with the catalytic domain plus linker region of CBHI (Nyyssonen et al.. 
Rio/T chnoloov 11:591-595, 1993). 

Without intending to be limited to a particular theory, applicants suggest that the amount of 
glucoamylase or other naturally secreted fungal polypeptide which is needed to enhance 
the production of a desired polypeptide can be surmised based on the known 3-D structure 
of glucoamylase. In the case of bovine chymosin production by A. nlger var. awamori it has 
been shown that fusion of prochymosin following the 11th amino acid of mature 
glucoamylase provided no benefit compared to production of preprochymosin (US Patent 
5,364,770). The inventors have shown (USSN 08/318,494) that fusion of prochymosin 
onto the C-terminus of preproglucoamylase up to the 297th amino acid of mature 
glucoamylase plus a repeat of amino acids 1-11 of mature glucoamylase yielded no 
secreted chymosin in A. n/gervar. awamori. In the latter case it is unlikely that the portion 
(approximately 63%) of the glucoamylase catalytic domain present in the fusion protein was 
able to fold correctly so that an aberrant, mis-folded and/or unstable fusion protein may 
have been produced which could not be secreted by the cell. The inability of the partial 
catalytic domain to fold correctly may have interfered with the folding of the attached 
chymosin. Thus, it is likely that sufficient residues of a domain of the naturally secreted 
polypeptide must be present to allow it to fold in its normal configuration independently of 
the desired polypeptide to which it is attached. Evans et al. (gene 91:131-134. 1990) 
showed that a form of A. niger glucoamylase truncated at its C-terminus back to amino acid 
460 in the catalytic domain was not efficiently secreted by yeast. However, the relevance 
of this to secretion in Aspergillus is not known. 

A linker region between the desired polypeptide and the domain of the naturally secreted 
fungal polypeptide may be beneficial in allowing the two polypeptides to fold independently. 
However, it may be possible to dispense with the linker region altogether. In the examples 
above where a foreign polypeptide is fused to the C-terminus of full-length glucoamylase or 
alpha-amylase, there is no linker separating the desired polypeptide from the C-terminus of 
the fungal polypeptide. Linkers other than the glucoamylase or CBHI linkers mentioned 
above could be used to connect a domain of a naturally secreted fungal polypeptide and 
the desired polypeptide. The salient features of a linker region are that it forms an 
extended, semi-rigid spacer betw en independently folded domains. Such linker regions 
are found in several proteins, especially hydrolases such as bacterial and fungal cellulases 
and hemicellulases (reviewed in Libby et al. 1994). 
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Based on the above, either the full length s creted polypeptide or a functional portion thereof 
may be employed in the present invention. As used herein a "portion" of a secreted polypeptide 
is defined functionally as that portion of a secreted polypeptide which when combined with the 
other components of the fusion polypeptide defined herein results in increased secretion of the 
desired polypeptide as compared to the level of desired polypeptide secreted when an 
expression vector is used which does not utilize the secreted polypeptide. Generally, such 
portions of the secreted polypeptide comprise greater than 50% of the secreted polypeptide. 

Specific expression hosts include A. nidulans (Yelton, M. et al. (1984) Proc. Natl. Acad. Sci. 
USA 81:1470-1474; Mullaney, E.J. et al. (1985) Mol. Gen. Genet 199:37-45; John, M.A. and 
Peberdy. J.F. (1984) Fn ^m* Microb. Technol. 6:386-389; Tilbum et al. (1982) Gene 26:205- 
221; Ballance. D.J. et al. (1983) Rirv^m Bioohvs. Res. Comm. 112:284-289; Johnston, I.L. et ■ 
al. (1985) EMBO 4:1307-1311); A. niger (Kelly, J.M. and Hynes M. (1985) EMBO 4:475-479); A. 
awamori, e.g., NRRL 3112. ATCC 22342, ATCC 44733, ATCC 14331 and strain UVK 143f; A. 
oryzae, e.g.. ATCC 1 1490; N. crassa (Case, M.E. et al. (1979) Pror. Natt. Acad. Sci. USA 
76:5259-5263; Lambowitz US Patent No. 4,486,553; Kinsey, J A. and Rambosek. J.A. (1984) 
K/. rt .^. la r a nH Cellular Biology 4:117-122; Bull, J.H. and Wooton. J.C. (1984) Nature 310:701- 
704); Trichoderma reesei, e.g. NRRL 15709. ATCC 13631. 56764, 56765, 56466. 56767 and 
Trichoderma viride, e.g.. ATCC 32098 and 32086. A preferred expression host is A. n/gervar. 
awamori in which the gene encoding the major secreted aspartyl protease has been deleted. 
The production of this preferred expression host is described in USSN 214.237 filed July 1. 
1988, expressly incorporated herein by reference. 

As used herein, a "promoter sequence" is a DNA sequence which is recognized by the 
particular Aspergillus species for expression purposes. It is operably linked to a DNA sequence 
encoding the above defined fusion polypeptide. Such linkage comprises positioning of the 
promoter with respect to the translation initiation codon of the DNA sequence encoding the 
fusion DNA sequence. The promoter sequence contains transcription and translation control 
sequences which mediate the expression of the fusion DNA sequence. Examples include the 
promoter from the A. awamori or A. n/gerglucoamylase genes (Nunberg, J.H. et al. (1984) MoK 
Cell. Biol. 4:2306-2315; Boel. E. etal. (1984) EMBO 3:1581-1585). the Mucor m/ehe; carboxyl 
protease gene (EPO Patent 0 215 594). the Trichoderma reese/cellobiohydrolase I gene 
(Shoemaker, S.P. et al. (1984) European Patent Application No. EP 0 137 280 A1), the A. 
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nidulans trpC g ne (Yelton, M. et al. (1984) Pr QC Natl. Acad. Sci. USA 81 :1470-1474; Mullaney. 
E.J. et al. (1985) Mol Gen. G net 199:37-45), the A. nidulans alcA gene (Lockington, R.A. et 
al. (1986) Gene 33:137-149), the A. nidulans tpiA g ne (McKnight, G.L et al. (1986) Cejl 
46:143-147), the A. nidulans amdS gene (Hynes, M.J. et al. (1983) Mol. Cell Biol. 3:1430-1439), 
and higher eukaryotic promoters such as the SV40 early promoter (Barclay, S.L and Meller, E. 
(1983) Molecular and Cellular Biology 3:21 17-2130). 

Likewise a "terminator sequence" is a DNA sequence which is recognized by the expression 
host to terminate transcription, it is operably linked to the 3' end of the fusion DNA encoding the 
fusion polypeptide to be expressed. Examples include the terminator from the A. nidulans trpC 
gene (Yelton, M. et al. (1984) Proc. Natl. Acad. Sci. USA 81:1470-1474; Mullaney, EJ. et al. 
(1985) Mol Gen. Genet. 199:37-45), the A. awamori or A. niger glucoamylase genes (Nunberg, 
J.K et al. (1984) Mol. Cell. Biol. 4:2306-253; Boel, E. et al. (1984) EM BO 3:1581-1585), and the 
Mucor /7J/efte/ carboxyl protease gene (EPO Patent No. 0 215 594), although any fungal 
terminator is likely to be functional in the present invention. 

A "polyadenylation sequence" is a DNA sequence which when transcribed is recognized by the 
expression host to add poiyadenosine residues to transcribed mRNA. It is operably linked to the 
3' end of the fusion DNA encoding the fusion polypeptide to be expressed. Examples include 
polyadenylation sequences from the A. nidulans trpC gene (Yelton, M. et al. (1984) Proc. Natl. 
Acad. Sci. USA 81:1470-1474; Mullaney, E.J. et al. (1985) Mol. Gen. Genet 199:37-45), the A. 
awamoriOTA. n/ger glucoamylase genes (Nunberg, J.H.etal. (1984) Mol. Cell. Biol. 4:2306- 
2315; Boel, E. et al. (1984) EMBO 3:1581-1585), and the Mucor m/eAre/ carboxyl protease gene 
(EP Patent 0 215,594). Any fungal polyadenylation sequence, however, is likely to be 
functional in the present invention. 

Example? , 

Example 1 
Construction of pGAST and pGASTK 

These plasmids were made to express the truncated form of alpha 2,3 sialyltransferase as 
a fusion of the entire glucoamylase coding region which included the catalytic core, the 
linker region and the starch binding domain in Aspergillus niger y/ar. awamori. The 
truncated form of sialyltransferase begins with leucine, the thirtieth amino acid from the 
initial methionine of the mature protein sequence. A BamHl/H/ndlll fragment of rat liver 
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alpha 2,3 sialyltransferas which included all of the structural gene and some 5' and 3' 
flanking ends was subclon d into pUC219. The original clone, pGIR23ST (Ichikawa et al. 
(1992) .1 Am. Ch m. Soc. 114:9283-9298), contained the full length ct-2,3 sialyltransferas e 
gene. A BamHl fragment of the gene was provided by Cytel Corp. (San Diego, CA, USA). 
Single stranded template was made of the resulting plasmid, pUC219:ST. Oligonucleotides 
were used to introduce a unique EcoRI restriction site at the 5* region of sialyltransferase 
and a unique Xba\ site at the 3* end of the gene. The EcoRI site was located at amino acid 
number 38 and 39 of the mature protein sequence and the oligonucleotide used was 5' 
ATGGGAAGACTCGAATTCACTGATT 3* (Seq ID No. 1). An XZ>al site was introduced 17 
bases after the stop codon of ST using the oligonucleotide 5' 

GCCGGACACTGGTCTAGAGAGGCTCAC 3' (Seq ID No. 2). Site directed mutagenesis as 
described by Maniatis et al. (in Molecular Cloning, A Laboratory Manual, 1980, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York) was used to introduce these sites into 
the single stranded template of pUC219:23ST. The correct clones of pUC219:ST R1/Xba 
were identified by restriction analysis and verified by DNA sequencing. This plasmid was 
subsequently digested with EcoRI and Xi>al and the 1.0 kb sialyltransferase fragment was 
isolated for ligations. 

Linker oligonucleotides were designed to: 1) connect the end of the giucoamylase coding 
region from a Sa/i site, 31 base pairs from the 3' end of the coding region, to the truncated 
version of sialyltransferase; 2) include or not include a KEX2 protease cleavage site 
between the structural genes; and 3) reconstruct the final part of the glaA coding region 
from the Sa/i site to the end of glaA coding region (minus the stop codon), as well as 
reconstruct the sialyltransferase gene from leucine codon #30 to the introduced EcoRI at 
codons 38 and 39. The KEX2 cleavage site which is recognized by the KEX2 protease is 
made up of two amino acids, lysine and arginine being the favored pair for cleavage. The 
pair of complimentary oligonucleotides used with the KEX2 protease cleavage site were 5' 
TCG ACC GCG ACG GTG ACC GAC ACC TGG CGG AAG CGC CTA CAC TTA CTC CAA 
TGG GAA GAC TCG 3* (Seq ID No. 3) and 5' A ATT CGA GTC TTC CCA TTG GAG TAA 
GTG TAG GCG CTT CCG CCA GGT GTC GGT CAC CGT CGC GG 3' (Seq ID No. 4). 
The pair of complimentary oligonucleotides used to combine the two genes without the 
KEX2 protease cleavage site were as follows: 5* TCG ACC GCG ACG GTG ACC GAC 
ACC TGG CGG CTA CAC TTA CTC CAA TGG GAA GAC TCG 3' (Seq ID No. 5) and 5'A 
ATT CGA GTC TTC CCA TTG GAG TAA GTG TAG CCG CCA GGT GTC GGT CAC CGT 
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CGC GG 3* (S q ID No. 6). Both sets of oligonucleotid link rs w r anneal d and 
phosphorylated b for ligation into th v ctor; pSL1180 (Pharmacia LKB Biotech, 
Piscataway, NJ, USA) digested with Sa/I and EcoRI. Several clones of the resulting 
plasmids, pSL1180:Sal-EcoR1 linker and P SL1180:Sal-EcoR1 kex2 linker, were sequenced 
and correct clones were identified. The correct clones of each were then digested with the 
restriction enzymes Sa/I and EcoRI and the linkers of both types were isolated for ligations. 

The vector, p:GAMpR was used to isolate a portion of the glucoamylase promoter, all of the 
glucoamylase terminator pBR322, and the Aspergillus niger var. awamori pyrG selectable 
marker by digesting it with the restriction enzymes Mlu\ and Xba\ t yielding a 9.6 kb and a 
3.7 kb fragment. The 9.6 kb fragment was isolated for ligation. p:GAMpR was previously 
described in US Patent 5,364,770. Briefly, this plasmid contains the Aspergillus niger var. 
awamori UVK143f glucoamylase promoter, glucoamylase coding region fused to bovine 
chymosin coding region, glucoamylase terminator, and the pyrG selectable marker and 
bovine chymosin in a pBR322 backbone. The 9.6 kb MuVXba\ fragment contained 1441 
bases of the 5* portion of the glucoamylase promoter, the glucoamylase terminator, pyrG, 
and pBR322. 

The vector pUC: AaGAM contained the Aspergillus niger var. awamori UVK1 43f 
glucoamylase promoter, glucoamylase structural gene, and glucoamylase terminator in a 
pUC vector. This was digested with restriction enzymes Mlu\ and Sa/l to yield a 6.3 and a 
2.7 kb fragment The 2.7 kb fragment, which contained 526 bases of the 3' end of the 
glucoamylase promoter and the whole coding region except for the final base pair, was 
isolated for ligation. 

A four way ligation took place with all of the above isolated fragments: Mlu/Xba of 
p-.GAMpR, the MluVSal\ of P UC:AaGAM, the EcoRi/Xdal of pUC219:23ST Rl/Xbal, and 
either the Sa/-EcoRI linker or the Sa/-EcoRI kex2 linker. The ligation produced several 
correct clones which were confirmed by restriction analysis. These plasmids were named 
pGAST (no KEX2 protease cleavage site) and pGASTK (with the KEX2 protease cleavage 
site). 



The resulting plasmids were purified on a cesium chloride (CsCl) gradient in preparation 
transformation into Aspergillus niger var. awamori. 
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Exampl 2 

Construction of pGADVST and pGADVSTK 
These plasmids were made to express the truncated form of alpha 2,3 sialyltransferase as 
a glucoamylase fusion in Aspergillus niger var. awamori similar to the constructs described 
in Example 1 . However, in these constructs, the last 120 codons of the glucoamylase 
starch binding domain were omitted. Two approaches were taken to express 
sialyltransferase as a glucoamylase fusion. The first approach involved fusing the 
glucoamylase coding region from the Nhe\ site which included all of the catalytic core and 
most of the glucoamylase linker region through amino acid #496 (alanine) to the 
sialyltransferase coding region from leucine #30. The second approach was nearly 
identical to the first approach with the addition of a KEX2 cleavage site in between the two 
coding regions. The following oligonucleotide pairs were used to fuse the two coding 
regions: 5' C TAGCAAAGCTACACTTACTCCAATGGGAAGACTCG 3' (Seq ID No. 7) and 
5' AATTCGAGTCTTACCATTGGAGTAAGTGTAGCTTG 3* (Seq ID No. 8) for the construct 
without the KEX2 protease cleavage site and 5' 

CTAGCAAGCGCCTACACTTACTCCAATGGGAAGACTCG 3' (Seq ID No. 9) and 5' 
AATTCGAGTCTTACCATTGGAGTAAGTGTAGGCGCTTG 3' (Seq ID No. 10) for the 
construct which included a KEX2 protease cleavage site in between the coding regions. 
The primer pairs were annealed by adding equal volumes of the primers into a tube and 
placing the tubes in a 100*C heat block for 2 minutes. The tubes were then slowly cooled 
to room temperature. The annealed primers were then phosphorylated as described by 
Maniatis et al. (in Molecular Cloning, A Laboratory Manual, 1980, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York). The vector pSL1180 (Pharmacia LKB Biotech) 
was digested with Nhe\ and EcoRl at 37°C for 1 hour and dephosphorylated with calf 
alkaline phosphotase (Boehringer Mannheim) for 1 hour, again at 37°C. Both sets of the 
annealed and phosphorylated linkers were ligated to the pSL1180 NheVEcoRl digested 
vector in two separate ligations and transformed into E. coll DNA was isolated from E. coli 
colonies obtained from these ligations and restriction endonuclease digestion identified 
potential correct clones of the NheVEcoRl linkers were obtained and after DNA sequence 
analysis, correct linkers were identified. The plasmids containing the correct linkers were 
digested with Nhe\ and EcoRl and subjected to electrophoresis on a 4% GTG NuSieve 
agarose gel (FMC Corporation, Rockland, Maine). After electroelution of the DNA bands 
containing the correct linkers from agarose, ethanol precipitation, and resuspension into TE 
buffer (lOmMTris H-CI. pH 7.6, 1 mM EDTA), the linkers were now ready for use. 
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An MluVNhe\ digest was performed on pGAMpRpyr (describ d above) and the 2.3 kb 
fragment that included 526 bases of the glaA promoter and the glucoamylase coding r gion 
up to codon 496 was isolated on a 1% agarose gel and eluted. An £coRI/Ncol digest of 
pUC219:23ST (described above) produced an 800 base pair sialyltransferase fragment that 
was missing the last 202 bases of the coding region. This fragment was isolated. 

P SL1180 (Pharmacia LKB Biotech) was digested with M/ul/Wcol, dephosphorylated, gel 
isolated and electrocuted. With each of the above isolated fragments and linkers, two 
four-way ligations were done. One set involved the linker without the KEX2 protease 
cleavage site and the other set involved the linker with this cleavage site. The resulting two 
plasmids, pSLGADVST Part 1 and pSLGADVSTK Part 1, were confirmed to be correct by 
restriction analysis. Both plasmids were then digested with Nhe\ and AvrU and an 80 base 
pair fragment was isolated from each plasmid which included the linker and 32 bases 3 1 
from the EcoRI site of sialyltransferase. An A/hel/Awll digest was done on 
pSL:GASTpyrGV (refer to Figure 3C) to yield the large 1 1 .3 kb vector fragment. 
pSL:GASTpyrGV was the result of subcloning a Spel and C/al fragment from pGAST into 
PSL1180. The SpeUC/al fragment contained approximately 800 bases of the 3' end of the 
promoter, the entire glucoamylase coding region, the truncated sialyltransferase gene and 
the glaA terminator. The resulting plasmid pSLGAST was then digested with EcoRV, a 
unique site found 250 bases from the 5' end of the terminator. A SmaUScal fragment 
containing the A. nigerpyrG selectable marker was ligated into the pSLGAST EcoRV 
digested vector resulting in pSLGASTpyrGV. 

Two separate ligations were performed with both 80 base pair linkers from P SL:GADVST 
Part 1 and pSL: GADVSTK Part 1 into the NheVAvM digested pSLGASTpyrGV. The 
resulting plasmids, after confirmation by restriction analysis, were named pSL:GADVST 
Part 2 and pSLGADVSTK Part 2, and each contained the glucoamylase promoter from the 
MliA site, the glucoamylase coding region without the starch binding domain, the linker with 
(pSL:GADVSTK) and without the KEX2 (pSL.GADVST) cleavage site, the truncated 
sialyltransferase gene and the glaA terminator. Both pSLGADVST Part 2 and 
pSLGADVSTK Part 2 were digested with MliA and a 3.3 kb fragment was isolated 
containing glucoamylase promoter and coding region without the starch binding domain, 
and sialyltransferase coding region minus the last 50 bases. Finally. pGASTK (described 
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abov ) was dig sted with MM and the 7.0 kb fragment which contained th r st of the glaA 
promoter of 526 bas s. the pyrG selectable marker, and th last 50 bases of 
sialyltransferase was isolated. Both 3.3 kb MM fragments of pSL:GADVST Part 2 and 
pSLGADVSTK Part 2 were ligated into the 7.0 kb MM fragment isolated from pGASTK. 
The final ligation yielded pGADVST and pGADVSTK. Both expression vectors were then 
grown for large scale isolation of the plasmid in order to obtain mass quantities of purified 
plasmid for fungal transformations. 

Example 3 
Fungal Transformations 
CsCI gradient purified plasmid DNA was used to transform Aspergillus niger var. awamori 
dgr246 P2. dgr246 P2, described by Ward, M. et al. (1993) Appl Microbiol. Biotech. 
39:738-743, is a strain derived from UVK143f that has been deleted of its major aspartic 
proteinase Jaspergillopepsin) gene. Briefly, this strain had undergone several rounds of 
mutagenesis selection for improved expression of a heterologous gene product. The 
transformation protocol was a modification of the Campbell method (Campbell et al. (1989). 
Curr. Genet. 16:53-56). All solutions and media were either autoclaved or filter sterilized 
through a 0.2 micron filter. Spores of dgr246 P2 were harvested off of complex media agar 
(CMA) plates. CMA contained 20 g/L dextrose, 20 g/L DifcoBrand malt extract, 1 g/L Bacto 
Peptone. 20 g/L Bacto agar, 20 ml/L of 100 mg/ml arginine, 20 ml/L of 100 mg/ml uridine, 1 
ml/L of 10 mg/ml PABA, 10 ml/L of met/bio solution, and 1 ml/L of 50 mg/ml streptomycin. 
Met/bio solution was composed of 50 g of L-methionine. 200 mg of D-biotin and distilled 
water up to 1 liter. An agar plug of approximately 1.5 cm square bearing sporulating 
mycelium was used to inoculate 100 mis of yeast extract broth with glucose (YEG) (5 g/L 
yeast extract. 20 g/L glucose, 20 ml/L 100 mg/ml arginine. 20 ml/L 100 ml/ml uridine, 1 ml/L 
50 mg/ml streptomycin). The flask was incubated at 37'C on a shaker at 250-275 rpm, 
overnight The mycelia were harvested through sterile Miracloth (Calbiochem. San Diego, 
CA. USA) in a funnel and washed with 200 mis of solution A (0.8 MgS04 in 10 mM sodium 
phosphate. pH 5.8). The washed mycelia were placed in a sterile solution of 100 mg of 
Novozym234 (Novo Nordisk Industri A/S. Baagsvad, Denmark) in 20 mis of solution A. 
This was incubated at 28'C at 200 rpm for 1 hour in a sterile 250 ml bottle (Coming Inc.. 
Coming, NY. USA). After incubation, the protoplasting solution was filtered through sterile 
Miracloth filters into a sterile 50 ml conical tube (Sarstedt. USA). The resulting liquid 
containing protoplasts was divided equally amongst four 50 ml conical tubes. Forty mis of 
solution B (1.2 M sorbitol, 50 mM CaCI 2 , 10 mM Tris. pH 7.5) were added to each tube and 
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centrifuged in a table top clinical c ntrifuge (Damon IEC HN Sll centrifug ) at 3/4 sp ed for 
10 minutes Th supernatant from each tube was discarded and 20 mis of fresh solution B 
was added to one tube, vortexed. then poured into the next tube until all the pell ts w re 
resuspended. The tube was then centrifuged at 3/4 speed for 10 minutes. The 
supernatant was discarded. 20 mis of fresh solution B was added, the tube was centrifuged 
for 10 minutes at 3/4 speed. The wash occurred one last time before resuspending the 
wash protoplasts into solution B, at 100 ul/0.5-1.0 X 10* protoplasts. To each 100 ul of 
protoplasts in a sterile 15 ml conical (Sarstedt. USA). 10 ul of the transforming plasm.d 
DNA was added. To this, 12.5 ul of solution C (50% PEG 4000. 50 mM CaCb. 10 mM Tns. 
pH 7.5) was added to the 15 ml tube and placed on ice for 20 minutes. One ml of solution 
C was added to the tube, removed from the ice to room temperature and mixed gently. 
Two mis of solution B were added immediately to dilute solution C. The transforming mix 
was added equally to 3 tubes of melted MMS overlay (6 g/L NaN0 3 . 0.52 g/L KCI. 1.52 g/L 
KH 2 PO,, 218.5 g/L D-sorbitol. 1.0 ml/L trace elements-LW, 10 g/L SeaPlaque agarose 
(FMC Byproducts. Rockland. Maine. USA) 20 ml/L 50% glucose. 2.5 ml/L 20% 
MgS0 4 *7H 2 0, 10 ml/L met/bio solution. 2.0 ml/L 10 mg/ml PABA pH to 6.5 with NaOH) that 
were stored in a 37°C water bath. Trace elements LW consisted of 1 g/L FeSCV7H 2 0, 8.8 
g/L ZnSO/7 H 2 0. 0.4 g/L CuS0 4 *5 H 2 0. 0.15 g/L MnS0 4 *4 H 2 O.0.1 g Na 2 B<O 7 *10 H 2 0. 50 
mg/L (NH«) 8 Mo^M H 2 0. 250 mis H 2 G. 200 ul/L concentrated HCI. The melted overlays 
with the transformation mix were immediately plated onto 3 MMS plates (same as MMS 
overlay recipe with the exception of 20 g/L of Bacto agar instead of 10 g/L of SeaPlaque) 
that had been supplemented with 200 ul/plate of 100 mg/ml of arginine added directly on 
top of the agar plate. After the agar solidified, the plates were incubated at 37'C until 
transformants grew. 

The sporulating transformants were picked off with a sterile toothpick onto a plate of 
minimal media ♦ glucose (MM). MM consisted of 6 g/L NaN03, 0.52 g/L KCI. 1.52 g/L 
KH2P04. 1 ml/L Trace elements LW. 20 g/L Bacto agar. pH to 6.5 with NaOH. 25 ml/L 40 
% glucose 2.5 ml/L 20% MgSo4*7H20. 1 ml/L 50 mg/ml streptomycin plus 10 mg/ml 
carbenicillin. 20 ml/L 100 mg/ml arginine. Once the transformants grew on MM, they were 
transferred to a CMA plate, one transformant per plate of CMA. 
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Example 4 

Screening f Sialyltransf rase Transformants 
A 1 5 cm square agar plug of each transformant was grown in 50 mis in a 250 ml shake 
flask of inoculum built-up medium, CSL + 5% xylose (100 g/L com steep liquor (50% 
solids), National). 1 g/L NaH 2 P0/H 2 O, 0.5 g/L MgSO,. 50 g/L xylose, 3 g/L mazu (Mazur 
Chemicals, USA), P H to 5.8 with NaOH, 1 ml/L 50 mg/ml streptomycin plus 10 mg/ml 
carbenicillin. Transformants and the parent control dgr246 P2 were incubated at 25»C, 160 
rpm for 2 days. Five mis of the 2 day old medium were inoculated into 50 mis of Clofine 
speciai (70 g/L sodium citrate, 15 g/L (NH,) 2 SO,. 1 g/L NaH 2 PO,-H 2 0. 1 g/L MgSO,. 10 
ml 10% Tween 80, 2 ml/L mazu. 60 g/L spray dried tofu TF30 soy milk powder (Armour 
Good Ingredients, Springfield. Kentucky. USA), 12% final concentration of maltose. 20 
ml/L. 100 mg/ml arginine. 20 ml/L 100 mg/ml uridine. 10 ml/L met/bio solution, 1 mg/L 
streptomycin plus 10 mg/ml carbenicillin) in a 250 ml shake flask. The Clofine Special 
flasks were incubated at 25'C, 160 rpm for up to 7 days. Assays were performed on the 
supematants of the growing cultures to determine the best producers of the enzyme of 
interest. The sialyltransferase assays were performed as described below. 

Sialvltransferase Assay 

The sialyltransferase assay was used to measure the enzymatic activity of alpha 2.3 
sialyltransferase that catalyzed the following reaction: 

CMP-sialic acid " CMP + 

+ Lacto-N-Tetraose sialyl lacto-N-tetraose 

The sialylated product was separated from unsialylated material on a column of ion 
exchange resin. Use of [Hcj-labeled substrate allowed measurement of the sialylated 
product by liquid scintillation counting. The resin used in the columns (Poly-prep 
chromatography columns 0.8 x 4 cm. Bio-Rad, Hercules. CA, USA) was AG1-X8. 20(M00 
mesh, chloride form (Bio-Rad). The resin was prepared by measuring out 100 g of the 
resin and then adding 400 mis of 1 M NaOH. The liquid was decanted and washed with 3 
liters of distilled water. After the water was decanted. 400 mis of 1 M phosphoric acid was 
added stirred and the resin was allowed to settle. The phosphoric acid was then decanted 
and the resin was washed with 3 liters of distilled water. After the waterwas decanted. 400 
mis of 0.1 M sodium phosphate buffer. pH 6.8, was added and the resin and buffer slurry 
were further adjusted to P H 6.8. This was then washed a final time with 3 liters of distlled 
water and a final 50% slurry was made with MilliQ water. The slurry of prepared res.n was 
swirled and 1 ml of the slurry was pipetted into a Bio-Rad column and allowed to settle. 
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The assay mix was mad up of 0.5 mM cytidine 5' monophosphat sialic acid (CMP-SA) 
(Boehring r Mannheim, Indianapolis, IN, USA), 12.3 mM sodium cacodylate buffer, pH 6.0 
(Sigma, St. Louis, MO t USA), 1.25% Triton CF-54 (Sigma), and 1.8 mg/ml bovine serum 
albumin (Sigma). The acceptor molecule, lacto-N-tetraose (LNT) (Sigma) was made to a 7 
mM final concentration. The column buffer used to wash the resin was 5 mM sodium 
phosphate buffer, pH 6.8. The concentration of the assay mix was measured after 
dissolving the dry NEN-619 cytidine 5' monophosphate sialic acid (CMP-[1 4 C]-SA ) (NEN 
Research Products, Dupont, USA) with 0.65 ml of the non-radioactive 5.76 mM CMP-SA. A 
1:100 dilution of this was measured at OD270 against a dH20 blank. The CMP-SA 
concentration was calculated as follows: 

[CMP-SA concentration (Molar) = {OD270)(1 mol/8600)(dilution factor)] 

The assay mix was diluted so that when 25 ul of reaction mix were added to each sample, 
each sample received the equivalent of 3.4 ul of the above mixture. To the assay mixture, 
960 ul 1 M NaCacodylate, 480 ul 20% Triton CF-54 and 290 ul 50 mg/ml BSA (filter 
sterilized) was added. Three hundred ul aliquots of this final mixture were stored at -20°C. 
The reaction mix was prepared by adding 250 ul of assay mix, 70 ul of 7 mM LNT, 12 ul of 
1 M MnCI and 168 ul dH z O to an Eppendorf tube. 

For each sample to be tested, 25 ul of reaction mix were added for each 14 ul of sample 
(supernatant from spent broth, standards and a buffer blank) to an Eppendorf tube. The 
tubes were spun down momentarily in a microcentrifuge and incubated in a 37°C water 
bath for 10 minutes. The standard alpha 2,3 sialyltransferase was provided by Cytel 
Corporation (San Diego, CA, USA) at 1 unit/ml. From this, dilutions using the sodium 
phosphate buffer were made for a range of 20-140 milii units, to generate a standard curve. 
Dilutions of the spent supematants were also made in the sodium phosphate buffer. After 
incubation, 1 ml of column buffer was added to each tube and each sample was passed 
over an equilibrated column of resin. The unbound product was collected into a scintillation 
tube, washed with another 1 ml of column buffer, collected again and pooled. To each 
tube! 5 mis of aqueous scintillation fluid (ICN Ecolume, Costa Mesa, CA, USA) were added, 
shaken vigorously and counted in a liquid scintillation counter (TM Analytical) for 1 minute. 
The formulae required for the calculations of sialyltransferase activity units were as follows: 
a) Total nmol CMP-SA in reaction = CMP-SA X 3.4 ul 

where CMP-SA concentration = molar concentration as determined above; 



PCT/US96/07171 

- 26 - 

cpm/nmol = total reaction cpm/nmol CMP-SA 

where the total reaction cpm = total counts added to th reaction, corrected 
for blank and nmol CMP-SA = total nmol CMP-SA in the reaction as 
calculated in a); and finally 
Activity (units/ml) = u mo i/m in/ml = 

[(sample cpm)/(cpm/nmol)](1/10min)(dilution factor)(1/0.014ml)(1.6)(1 
umol/1000 nmol) 

sample cpm = average cpm of sample corrected for average blank cpm 
10 minutes = reaction incubation time 
0.005 = volume of enzyme dilution tested 

1.6 = unit definition was specified at saturating substrate concentration. 
Since the standard assay contained a subsaturating level of LNT, activity 
was multiplied by a factor of 1.6 to reflect maximal velocity by analysis of 
kinetics. 

Example 5 
Screening of ST Transformants 
Following the transformation methods detailed in Example 3 t twenty transformants with the 
pGAST expression vector, 22 with the pGASTK expression vector, 9 with the pGADVST 
expression vector, and 12 with the pGADVSTK expression vector were screened in shake 
flasks. Using the sialyitransferase activity assay of Example 4, the following transformant 
exhibited the best sialyitransferase levels secreted by transformants with each expression 
vector for pGAST transformant #20, 270 units/liter, for pGASTK transformant #4, 210 
units/liter, for pGADVST#6, 120 units/liter and for pGADVSTK #1, 543 units/liter. Media 
optimization work as described in Example 6 was done with pGADVSTK#1 (DVK1). 

Example 6 

Media and Process Development for the Production 
of Sialyitransferase in Aspergillus 

a. Inoculum Medium 

The medium used for the generation of cell mass was composed of: 100 g/L corn steep 
liquor; 0.5g/L magnesium sulfate; 1 g/L monobasic sodium phosphate monohydrate; 3 ml/L 
Mazu; 50 g/L xylose. The medium pH was adjusted to 5.8. The inoculum medium was 
inoculated with the desired culture and incubated at 25°C for 48 hr at 250 rpm. After th 
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completion of the incubation time a given volum would b transf rred to the production 
stag . 

b. Production Medium 

The initial medium used for the media development work was a liquid production medium, 
"Clofine Special." Clofine Special contained 60 g/L spray dried soy milk; 15 g/L ammonium 
sulfate; 70 g/L sodium citrate; 1 g/L magnesium sulfate; 1 ml/L Tween 80; 1 ml/L Mazu; 0.5 
g/L potassium chloride; 1 g/L monobasic sodium phosphate monohydrate; 2 g/L arginine; 2 
g/L uridine; and 120 g/L maltose. The pH of the medium was adjusted to 6.2. The 
production media was inoculated with 5 mis of the desired culture, previously grown in the 
inoculum media. The production media was incubated the same as the inoculum media, 
except for a longer period of time. 



X-Stat, a statistical design software, was used to conduct a number of experiments directed 
at the optimization of the production of glycosyltransf erase, and particularly ST. After 
several experiments using X-Stat a new media evolved. The new medium contained: 
KH 2 P0 4 . 1 g/L; MgSCU, 0.8 g/L; KCI, 1 g/L; arginine, 4 g/L; Tween 80, 1 ml/L; Mazu, 13 
ml/L; spray dried soy milk, 60 g/L; maltose, 100 g/L; glucose, 10 g/L; ammonium sulfate, 
13 g/L; sodium citrate, 60 g/L. 

It was determined that the ST was aggregating to the spray dried soy milk in the media. 
Experiments were designed to look for an alternative complex nitrogen source for the 
medium. The new medium recipe described above was used to examine different nitrogen 
sources by substituting spray dried soy milk with wheat flour, Primetone CLT, Primetone 
SG, com gluten and Sheftone. DVK1 was grown in media containing the various nitrogen 
sources and it was determined that Sheftone N provided a good source of complex 
nitrogen, and additionally added stability to the ST produced in the medium. Thus, this 
Sheftone N-containing medium had the advantage of improving the expression of the ST, 
as well as provided advantages during the recovery process of the protein (less 
degradation during recovery). Additional experiments followed this finding. Experiments 
were set up to study different concentrations of Sheftone N in the media. Results indicated 
that various concentrations of Sheftone N resulted in good expression, while 
concentrations from about 40 g/L up to about 100 g/L (preferably £80 g/L of Sheftone N) 
resulted in increased production of ST. The data are shown in Figure 4. 
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Additionally, during media optimization it was determined that the pH of the medium had a 
very important effect on th production and stability of the ST produced. Experiments 
studying pH were conducted. Results indicated that by increasing the pH of the medium 
over about pH 6 up to 8 t increased levels of ST were produced. See Figure 5, 

c. Buffer Concentration Studies 

Based on the results shown in Figure 5, which indicated the pH of the medium had an 
effect on the production of ST, and knowing that the desired pH range for the medium was 
neutral, an experiment was conducted to study the buffer capacity of the medium. Sodium 
citrate which was present in the medium at an initial concentration of 60 g/L (see above) 
was increased up to at least 120 g/L. The results, as shown in Figure 6, indicated that 
stronger buffering capacity in the medium gave higher production of ST. 

d. Process Development/14 L Fermentation Experiments 

In order to scale up the results described above, Sheftone N-containing media (without th 
spray dried soy milk) at a concentration of 100 g/L Sheftone N and 60 g/L sodium citrate 
was used. The pH was adjusted to 6.5 and controlled at 5.8 with ammonia hydroxide. The 
fermenter produced 988 units/ml of ST. The yield from the fermentation came very close to 
the theoretical results based on the work presented here of 1057 units/ml. This represents 
a 93.5% duplication efficiency. 

Example 7 
Construction of pGAKHi+ 
The Expression Vector Used to Express Fucosy {transferase 
An expression plasmid was constructed using methods known in the art, which allows DNA 
sequences encoding desired polypeptides to be fused to the C-terminus of the catalytic 
core and part of the linker region of Aspergillus glucoamylase. 

The plasmid pGAKHi+ was constructed based on the bacterial plasmid pSL1180 
(Pharmacia LKB). This plasmid is similar to the pUC series of phagemids such as pUC118 
but has an extended multiple cloning site (MCS) containing all possible six base pair 
palindromes including two recognition sites for C/al, one of which is methylated and the 
other not methylated by E. coli dam methylase. An expression cassette was inserted into 
the MCS of pSL1 180 so that the first 28 bp of the MCS from H/ndlll to Spel was present at 
one nd. Beginning at the Spel recognition site, the expression cassette contains the 
following components: the promoter and 5' flanking region of the glaA gene of A. niger var. 
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awamori strain UVK143f (US Patent 5,364,770) starting at a Spe\ sit approximat ly 1 kb 
upstream from the translation start codon; the coding region of the gfaA gen from strain 
UVK 143f from the start codon to an Nhe\ site, the six bp which encode amino acids 497 
and 498 in the linker region of mature glucoamylase; a synthetic piece of double stranded 
DNA with Nhe\ and BstEW ends; approximately 250 bp of the terminator region of the glaA 
gene from A. niger strain #7 (US Patent 5,364,770) beginning at a BstEW site 20 bp 5' of 
the translation stop codon and ending at an EcoRV site; an approximately 2.4 kb fragment 
containing the A. niger pyrG gene (Wilson et al., 1988) obtained as a Smal to Seal 
fragment from pBH2 (Ward et al., 1989) ( the Sma\ site exists within the pUC18 MCS of 
pBH2 and the Seal site exists 65 bp from the end of the BamHr-H/ndlll fragment of A. niger 
DNA present in pBH2); approximately 1.3 kb of 3' flanking DNA from the giaA from A. niger, 
beginning at the EcoRV site 250 bp 3' of the translation stop codon and ending at a C/al 
site. After the C/al site there exists 75 bp of the pSL1180 MCS from C/al (the site not 
methylated by E coli dam methylase) to Hin6\\\, followed by bp of pBR322 from Hin6\\\ to 
C/al, followed by 198 bp of the pSL1180 MCS from C/al (the site methylated by E. colt dam 
methylase) to EcoRI. 

As shown in Figure 7 this expression vector (pGAKHi+) contains unique Nhe\ and BstEW 
sites which can be utilized to insert a DNA fragment encoding a desired polypeptide such 
that a fusion protein (comprising the glucoamylase catalytic core, part of the glucoamylase 
linker region and the desired polypeptide) would be expected to be produced after 
transformation into Aspergillus strains. The pyrG gene was included as a selectable marker 
for transformation into Aspergillus strains which are pyrG null mutants. 

Example 8 

Construction of pGAKHi-FT 

Source of the FT cDNA 

The human a1,3 fucosy (transferase V (FT) cDNA was obtained from Dr. John Lowe at the 
University of Michigan. The clone pcDNA1-FT contains the full-length FT sequence in 
pcDNAI vector (Weston, B.W.; Nair, R.P.; Lowe, J.B. (1991) J. Biol. Chem. 267:4152-60). 
The full-length clone encompasses an amino-terminal cytoplasmic tail, a transmembrane 
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signal anchor, a stem r gion, and a carboxy-t rminal catalytic domain as typical of 
glycosyltransf ras s (Paulson, J.C.; Coll y, K.J. (1989) J. Biol. Ch m. 264:17615-17618.) 

Constnjction of pGAKHi-FT Plasmid 

The construct was designed so that the A niger glucoamylase gene would be fused in 
frame to the catalytic domain of the FT gene. The FT gene was amplified using native Pfu 
polymerase (Stratagene) and the polymerase chain reaction (PCR) using Buffer #1 and 
conditions supplied by the manufacturer. The primers generate an Nhel restriction site at 
the 5' terminus and a BstEit site at the 3' terminus. The 5' primer was designed to 
hybridize to the stem portion of the FT gene, which fuses in-frame to the A. niger 
glucoamylase sequence of pGAKHi+ (Example 8), with a kex2 proteolytic cleavage site 
between the glucoamylase and FT genes. The 3' primer was designed to amplify the 3' 
end of the FT coding region including the stop codon. The 5' and 3' primers were 5 -CTA- 
GCT-AGC-AAG-CGC-GGA-TCC-CCT-AGG-CCA-3' (Seq ID No. 11) and 5-CCC-GGT- 
GAC-CTOAGG-TGA-ACC-AAG-CC-3' (Seq ID No. 12), respectively. The PCR procedure 
amplified the stem region and catalytic domain of the FT gene, yielding a 1.2 kb fragment. 

The amplified FT gene was sequentially digested with the enzymes Nhe I and BstEII (New 
England Biolabs), using conditions provided by the manufacturer, and fractionated on an 
agarose gel. The 1.2 kb fragment was isolated using a Prep-A-Gene kit (BioRad) and 
ligated under standard conditions into the unique Nhel/BstEII sites of plasmid vector 
pGAKHi+. This fused the stem region of the FT gene in-frame with the glucoamylase 
sequence, generating pGAKHi-FT (Figure 8). Plasmids were transformed into E. coli strain 
DH5a, and ampicillin-resistant colonies were screened for the presence of the FT gene. 
The glucoamylase-FT fusion junction was sequenced using a primer that hybridizes to the 
glucoamylase sequence (5-CCG-AGT-ATC-GTG-GCT-ACT-GGC-GGC-3' (Seq ID No. 
13)). The plasmid of the correct construct was purified using Qiaex matrix (Qiagen). 

Example 9 
Screening of FT Transformants 
Transformants were grown on agar plates of MM containing 5% xylose for 8 days at 37°C 
to generate spores. The spores were harvested in 0.1% Tween/20% glycerol and stored at 
-70°C until used as stocks. 
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Spor s (0.3 ml) from stocks w r inoculated into 250 ml baffl d flasks containing 50 mis of 
CSL + 5% xylose; cultures were grown at 37°C ( 130 rpm for 2 days. This culture was used 
as the inoculum for the production medium. 



Two production media, Clofine Special (Example 7) and Sheftone N (Example 7), were 
used for screening transformants. A 10% inoculum of CSL culture was inoculated into 50 
mis of production medium in 250 ml baffled flasks. Cultures were grown at 32°C, 130 rpm 
for 4 days, at which time supematants were assayed. 

The protocol for the FT assay is provided below and is available from Cytel, San Diego, CA. 
The assay measures the FT enzymatic activity and catalyzes the following reaction: GDP- 
fucose + N-acetyllactosamine — > GDP + fucosyl N-acetyllactosamine. The fucosylated 
product was separated from GDP-fucose on an ion exchange column resin (AG1 X 8, 
chloride form). Use of 14 C-labeled substrate (GDP-[ 14 C] fucose) allowed measurement of 
the fucosylated product by liquid scintillation counting. 



Fucosvltransf erase Assay Protocol 
Cocktail Mix 

70 u! [ 14 C]GDP-fucose (ca. 1.5 x 10 6 cpm) Evaporate with N 2 
5.5 ul cold GDP-fucose (6 mg/ml) 
35 ul 1 M Na Cacodylate, pH 6.0 
13 ul 1 M MnCI 2 

130 ul 100 mM N-acetyllactosamine 
400 ul dH ? Q 
584 ul total 



Final Concentration 

0.3 nmoles 
4.0 nmoles 
50 mM 
20 mM 
20 mM 



Assay 

45 ul cocktail mix 
5 ul diluted enzyme 
50 ul total per tube 

Preheat tube with 45 ul reaction mix, 37°C for ca. 5 min 
Add 5 ul diluted enzyme; time reactions 
Incubate 10 min at 37°C 
Terminate with 1 ml dH 2 0 

Run through AG1-X8 (chloride form) column (1.5 ml of 50:50 slurry) 

Wash 1 time with 1 ml dH 2 0 

Count 



- Run negative control 
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- Count 45 ul cocktail mix control for determining sp cific activity 

Example 10 

Media Optimization for Increasing FT Expression 
Various parameters were tested for increasing FT expression in transformant KK17. 
Temperature, pH, incubation time, agitation speed, inoculum size, and soy protein source 
were among the variables tested. Several of these parameters affected the level of FT 
expression. For example, the optimum agitation speed for FT activity was about 180-220 
rpm (Figure 9). In addition, pH affected expression, having an optimum value at pH 7.0 
(Figure 10). Doubling the inoculum size from 10 to 20% increased activity at pH 7.0, 220 
rpm from 580 U per liter to 690 U per liter. 

Example 1 1 

Recombinant Baculovirus Expression System 
Construction of Recombinant Baculovirus 

The recombinant baculovirus expressing ST was constructed as described in Ichikawa et 
al. (1992, J. Am. Chem. Soc. 114:9283-9298). The gene was fused to the human insulin 
leader sequence, which allows the glycosyltransferase to be secreted. The baculovirus 
construction made for FT expression was similar, except the cDNA was cloned into the 
vector pVL1 393, as illustrated in Fig. 11. 

Stock cultures of Spodoptera frugoperda insect cells (Sf9ll; Gibco/BRL) were grown at 
27.5°C in 2-liter Fembach flasks containing 1 liter of Sf900 II serum-free media 
(Gibco/BRL). Circularized pVL1393-FT or pBlueBac-ST (2 mg) was co-transfected with 
linearized BaculoGold viral DNA (0.25 mg; PharMingen) into Sf9ll cell monolayers using the 
transfection protocol of Invitrogen. The media containing recombinant virus particles was 
harvested after 5 days incubation at 27°C. To isolate single viral clones, 2.5 x 10 3 Sf9ll 
C eiis were infected with serial 5-fold dilutions (12 each) of virus in 96-well limiting dilution 
plates (O'Reilly, D.R., Miller, L.K., Luckow, V.A., in Baculovirus Expression Vectors, A 
Laboratory Manual, W.H. Freeman and Co., 1992). Virus from wells exhibiting cell lysis at 
the end-point dilution were amplified by infecting 5 x 10 s cells with 0.2 pfu per cell in 24-well 
dishes. The amplified clones were then tested for activity: 5 x 10 5 cells were infected with 
3-5 pfu per cell and dishes were incubated for 48 hrs. All of the clones were positive for 
activity, and one clone was used for the generation of the viral stock (passage 2), which 
was made in monolayer as follows: 3.0 x 10 7 cells were absorbed to a T225 cm 2 tissue 
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cultur flask and infected with 0.2 pfu virus per cell for 1 hr at room t mperature. Fresh 
Sf900ll media (30 ml) was then added, and flasks were incubated for 48 hrs at 27.5°C. 
Virus was harvested by c ntrifugation (3700 rpm, 20 min). Supernatants were stored at 4C 
and used to make virus stocks for large scale protein production. Virus titer was 
determined by limiting dilution assay and was typically ca. 3 x 10 s pfu per ml. 

Protein Production 

Virus stocks (passage 3) were produced in shake flasks as follows: Sf9ll cells were grown 
to 8 x 10 6 cells per ml in 2 L Fernbach flasks containing 0.5 L of Sf900ll media. Virus from 
passage 2 was added at 0.2 pfu per cell; another 0.5 L of fresh Sf900ll media was added, 
and flasks were incubated in shaking incubators (27.5°C, 1 30 rpm). Virus was harvested at 
48 hrs by centrifugation (3700 rpm, 20 min), and supernatants were checked for 
contamination and stored at 4°C. Virus titers were determined by limiting dilution assay 
and were typically 2 x 10 9 pfu per ml. 

For protein production, Sf9 II cells (1 L) were grown at 27.5°C in 2 L cotton-stoppered 
Fembach flasks to ca. 8 x 10 6 cells per mi. Cells were gently centrifuged (1000 rpm, 5 min) 
in sterile 1 liter bottles; supernatants were carefully dumped and virus was added to ca. 5-7 
pfu per cell (ca. 25-50 ml). Fresh Sf900ll medium was immediately added to restore the 
original volume, and cells were shaken at 27.5°C. Supernatants were harvested by 
centrifugation (3700 rpm; 15*) at 48 hrs post-infection for ST or 72 hrs for FT. 

Example 12 

Comparison of A. niger Expression System to Baculovirus Expression System 
Baculoviruses are insect-specific viruses that receive much interest for their use as 
recombinant protein expression systems. It is suggested to have a high rate and yield of 
expression, so it was desirable to compare it to the A. niger system described in this patent. 

ST and FT activities in the two expression systems are shown in Table 2, which are the 
maximum activities over time. 

Table 2 

Maximum FT and ST Activities in a Recombinant Baculovirus Expression System 
as Compared t th A. niger Expression Syst m 

System ST (U/Litert FT (U/Litert 
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Baculovirus 
A niger 



140 
1700 



50 
600 



The recombinant ST baculovirus produced 140 U/liter, while the A niger transfonmant 
produced 1700 U/liter. Likewise, the FT recombinant baculovirus expressed 50 U/liter, 
while FT A. niger transformant produced 600 U/liter. Both genes expressed 12-fold higher 
activities in A. n/ger than the corresponding baculovirus system. 

The foregoing are presented by way of example only and should not be construed as a 
limitation to the scope of permissible claims. 

Having described the preferred embodiments of the present invention, it would appear to 
those ordinarily skilled in the art that various modifications may be made to the disclosed 
embodiments, and that such modifications are intended to be within the scope of the 
present invention. 

All references are expressly incorporated herein by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Kodama, Katherine 

(B) STREET: 180 Kimball Way 

(C) CITY: So. San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 94080 

(G) TELEPHONE: (415) 742-7536 

(H) TELEFAX: (415) 742-7217 

(A) NAME: Ward, Michael 

(B) STREET: 180 Kimball Way 

(C) CITY: So. San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 94080 

(G) TELEPHONE: (415) 742-7536 

(H) TELEFAX: (415) 742-7217 



(ii) TITLE OF INVENTION: Expression of Glycosyltransf erase in 
Aspergillus 

(iii) NUMBER OF SEQUENCES: 13 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
ATGGGAAGAC TCGAATTCAC TGATT 25 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GCCGGACACT GGTCTAGAGA GGCTCAC 27 
(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCGACCGCGA CGGTGACCGA CACCTGGCGG AAGCGCCTAC ACTTACTCCA ATGGGAAGAC 60 
TCG 63 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AATTCGAGTC TTCCCATTGG AGTAAGTGTA GGCGCTTCCG CCAGGTGTCG GTCACCGTCG 60 
CGG 63 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TCGACCGCGA CGGTGACCGA CACCTGGCGG CTACACTTAC TCCAATGGGA AGACTCG 57 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AATTCGAGTC TTCCCATTGG AGTAAGTGTA GCCGCCAGGT GTCGGTCAC C GTCGCGG 57 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
C T AGC AAAGC T AC A CTTACTCCAA TGGGAAGACT CG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AATTCGAGTC TTACCATTGG AGTAAGTGTA GCTTG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
C TAGCAAGCG CCTAC AC TT A CTCCAATGGG AAGACTCG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AATTCGAGTC TTACCATTGG AGTAAGTGTA GGC GCTTG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
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CTAGCTAGCA AGCGCGGATC CCCTAGGCCA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
CCCGGTGACC TCAGGT GAAC CAAGCC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) strandedness: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCGAGTATCG TGGCTACTGG CGGC 
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What Is Claim d Is : 

1 . A fusion DNA sequence encoding a fusion polyp ptide comprising from the 5' nd 
of said fusion DNA sequence: 

a) DNA encoding a signal peptide functional in Aspergillus', 

b) DNA encoding a secreted polypeptide or portion thereof normally secreted 
from Aspergillus; 

c) optionally DNA encoding a cleavable linker polypeptide; and 

d) DNA encoding a desired gly cosy (transferase. 

2. A fusion DNA sequence of Claim 1 wherein the desired gly cosy [transferase is 
selected from the group consisting of sialyltransferase, galactosyltransferase and 
fucosyltransferase. 



3. A fusion DNA sequence of Claim 1 wherein the signal peptide is selected from the 
group consisting of signal peptides from glucoamylase, a-amylase, and aspartyl 
protease from Aspergillus species and signal peptides from Trichoderma 
cellobiohydrolase I and II, and endoglucanase I and II. 

4. A fusion DNA of Claim 3 wherein the signal peptide is the signal peptide from A. 
niger war. awamori glucoamylase. 

5. A fusion DNA of Claim 1 wherein the secreted polypeptide or portion thereof is 
glucoamylase from Aspergillus. 

6. A fusion DNA of Claim 5 wherein the secreted polypeptide is the full length mature 
glucoamylase from A. niger war. awamori. 

7. A fusion DNA of Claim 5 wherein the secreted polypeptide is a portion comprising 
50% of glucoamylase from A. niger var. awamori. 



8. 



A fusion DNA of Claim 1 wherein the signal peptide comprises the signal peptide of 
A, niger var. awamori glucoamylase, the secreted polypeptide or portion thereof 
comprises glucoamylase from A. niger var. awamori, and the desired transferase is 
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selected from the group consisting of siaiyltransferase, galactosyltransferase and 
fu cosy Itransf eras . 

9. An expression vector for transforming an Aspergillus host cell comprising DNA 
sequences encoding regulatory sequences functionally recognized by the 
Aspergillus host, including promoter and transcription and translation initiation 
sequences operably linked to the 5' end of the fusion DNA sequence of Claim 1 and 
transcription stop sequences and polyadenylation sequences operably linked to the 
3' end of said fusion DNA sequence. 

10. An Aspergillus containing an expression vector of Claim 9. 



11. A fusion polypeptide comprising from the 5* end: 

a) an amino acid sequence encoding a signal peptide functional in Aspergillus; 

b) an amino acid sequence encoding a secreted polypeptide or portion thereof 
normally secreted from Aspergillus] 

c) optionally an amino acid sequence encoding a cleavable linker; and 

d) an amino acid sequence encoding a desired glycosyltransferase. 

12. A fusion polypeptide of Claim 1 1 wherein the desired glycosyltransferase is selected 
from the group consisting of siaiyltransferase, galactosyltransferase and 

fucosy Itransf erase. 

13. A fusion polypeptide of Claim 1 1 wherein the signal peptide is selected from the 
group consisting of signal peptides from glucoamylase, a-amylase, and aspartyl 
protease from Aspergillus species and signal peptides from Trichoderma 
cellobiohydrolase I and II, and endoglucanase I and II. 

14. A fusion polypeptide of Claim 13 wherein the signal peptide is the signal peptide 
from A. nigervar. a wamori glucoamylase. 



15. 



A fusion polypeptide of Claim 11 wherein the secreted polypeptide or portion thereof 
is glucoamylase from Aspergillus. 
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16. A fusion polypeptid of Claim 15 wherein the s cr t d polypeptid is the full I ngth 
matur glucoamylase from A. niger var. awamori. 

17. A fusion polypeptide of Claim 15 wherein the secreted polypeptide is a portion 
comprising 50% of glucoamylase from A, niger var. awamori. 

18. A fusion polypeptide of Claim 1 1 wherein the signal peptide comprises the signal 
peptide of A. niger var. a wamori glucoamylase, the secreted polypeptide or portion 
thereof comprises glucoamylase from A. niger var. awamori, and the desired 
glycosy I transferase is selected from the group consisting of sialy (transferase, 
galactosyltransferase and fucosyltransferase. 
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pGADVST and pGADSTK 
(A) Isolation of linkers 

linker without a kex2 protease cleavage site 

5 ' CTAGCAAGC TAC ACTTACTCCAATGGGAAGACTCG 3 ' 
5 1 AATTC GAGTC TTAC C ATTGGAGTAAGTGTAGC TTG 3 ' 



linker with a kex2 protease cleavage site 

5' CTAGCAAGC GCCTACACTTACTCCAATGGGAAGACTCG 3' 
5 1 AATTCGAGTC TTACCATTGGAGTAAGTGTAGGCGCTTG 3 ' 
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